Introduction

I miss the old Kanye… “I Love Kanye” - Kanye West (2016)

But who, what & when is this ‘old’ Kanye? Is it possible to uncover who the old Kanye is by analyzing his music? Are there any significant differences in his earlier and later works, and is there a way to cluster these works so we have an ‘old’ Kanye and a ‘new’ Kanye?

To find an answer to these questions, we will build a corpus consisting of the 9 studio albums released by Kanye West and his 2 collaborative albums. Any non-musical tracks, e.g. skits, featured on the albums will be included. If a distinction can be made between old Kanye and new Kanye, we expect there to be musical differences between his first three albums and his latter releases. This is where the music changed from cheery & soulful to dark & electronic-inspired, so we expect to see differences in valence as this indicates mood and timbre coefficients as these indicate the tone color. His latest album, Jesus Is King (2019), is an outlier concerning this stylistic change and could mean that Kanye West is returning to his original sound, or that potentially a ‘future’ Kanye is in the making.

The 2 collaborative albums (with Jay-Z and Kid Cudi, respectively) are included as I feel that Kanye did have enough of an influence during the creation of these works so his personality should be reflected in the music, although it might be for only 50%. Singles and tracks where Kanye was a featured artist are excluded from the corpus.

First, we will try to find some useful features we could use for our clustering algorithm using Spotify’s API. Afterwards, we will investigate other tools to compare musical pieces and finally we will implement our clustering algorithm which will be used to discover which albums sound the most similar.

Seperating albums on their mood using valence


As mentioned in the introduction, we need to find out which audio features contain the most useful information for our corpus.

One of such features is valence, which Spotify describes as the musical positiveness. To the left is a plot of the distribution of the valence for each album. We can see that his first three albums sound positive (valence > 0.5) and the latter sound negative (valence < 0.5). For this reason, valence seems to be a good feature as there is a clear distinction between Kanye’s early and newer work.

Seperating albums on their tempo using BPM


Another feature with similar behavior is the tempo. From the Spotify API, we can find the tempo in beats per minute (BPM) for each track. From the plot of the distribution of these tempi, we see that again the first three albums are distinct from the rest of Kanye’s work. There is one exception for his latest album ’JESUS IS KING. These albums all have an average BPM of < 100 whilst the rest of his albums all have an average BPM of > 100

The BPM used for hip-hop beats is generally around 60-100 bpm. This indicates that his early and newest work are typical Hip Hop albums but the other albums aren’t.

Seperating albums on their emotional level using valence and energy


Whereas valence as a standalone feature tells us something about the general mood of a song, e.g. positive or negative, we could combine the valence with the energy feature. This pair of parameters now tells us something about the emotional level of a track. Low energy + low valence songs are considered ‘sad’, low valence + high energy are ‘angry’, songs with high valence + low energy are ‘calm’ and high valence + high energy are ‘happy’.

Using this new emotion feature, we can see that most of Kanye’s early work is classified as Happy, his newest work is mostly Sad and the albums released in between are Angry.

Using Spotify’s lower-level track audio features to find differences in Kanye’s albums


We already saw from the previous pages that we were able to distinguish Kanye’s work using Spotify’s audio features like valence, energy & tempo. These features are quite high-level, meaning that Spotify already did a lot of work to obtain these features. It is also possible to obtain lower level features from Spotify’s API, which might also add extra information when trying to cluster albums into groups.

We are going to compare two albums, Yeezus & The College Dropout (TCD). These albums are, according to analysis using the higher level features, the most contrasting works from Kanye’s discography. For this comparison we will look at the average timbre coefficients and the average pitch classes. From the two distributions of these features, we can actually see that these two albums are not as distinct as we might have thought.

For the timbre coefficients we see that they are distributed similarly for both albums. There is an exception for coefficient c05 where the mean is a bit higher for TCD compared to Yeezus, and there is one for coefficient c07 where the distribution for Yeezus is left-skewed, and vice versa. For the pitch classes, there is a bit more distinction between the two albums. Especially the variance of the pitch classes, which is significantly larger for Yeezus than for TDC, could potentially serve as an extra variable when trying to cluster Kanye’s albums.

It seems that we could use these timbre coefficients and pitch classes as extra features for our clustering algrithm, because they could serve both as a way to group albums together and as a way to distinguish albums.

Can we use Spotify’s tempo feature for classification or clustering?


In our previous work (Seperating albums on their tempo using BPM), we identified Spotify’s tempo feature as a possible candidate to distinguish Kanye’s albums. We also identified a couple of outliers, which only occurred for his slower & early work (and his latest album, which indicated a new change of styles). We will inspect the two worst outliers using a tempogram. These outliers are “I Wonder” from the album “Graduation” and “Follow God” from the album “Jesus Is King”.

For our first song, “I Wonder”, Spotify labeled the tempo as 191.385 BPM. This is actually the highest for our entire corpus, and differs greatly from the mean tempo of the album it is featured on which is 80 BPM. Listening to the track, we don’t get the idea that this BPM is correct and neither that Spotify should have a hard time measuring the BPM. Upon inspecting the “Fourier-based” and “Cyclic” tempograms, we see that indeed Spotify was able to produce clear plots. However, it prefers to label the BPM according to the “Fourier-based” tempogram which picks up strongly on tempo harmonics. The correct BPM should be the sub-harmonic, 96 BPM

We see the same behavior for our other outlier “Follow God”. Spotify incorrectly labeled the BPM as 180 BPM, with the correct being its sub-harmonic 90 BPM.

These incorrect labels will hurt the performance of a classifying/clustering algorithm, as these outliers are more likely to be assigned to the incorrect groups. Therefore, we should correct the tempo features for such outliers using tempograms before we apply our clustering algorithm .

Comparing the structure of the opening tracks of My Beautiful Dark Twisted fantasy and Yeezus


Another way to compare Kanye’s music is to look at the way he structured his songs using self-similarity matrices (SSMs). Lets have a look at the opening tracks of his 5th album My Beautiful Dark Twisted Fantasy (2010) and his 6th album Yeezus (2013).

My Beautiful Dark Twisted Fantasy (MBDTF) features a maximalist aesthetic & clean production quality. Wikipedia classified the album as Hip hop / prog-rap/ popart/ pop / rap opera. If we think about most of these genres, we think of a clearly structured song. The opening track for MBDTF is self titled ‘Dark Fantasy’ and checks this box.

From both the chroma and the timbre matrix, we can see the structure of the song. After a short intro, we start with a prolonged version of the chorus. From the diagonal lines in the chroma matrix, we can clearly see that this chorus itself is repeating a couple of bars. Following this we have two verses from Kanye with a shorter version of the chorus in between. After the 3rd chorus, we have a small bridge section and another short version of the chorus. The track then ends with an instrumental loop and after a brief silence + sharp sound (clearly depicted in the timbre matrix), we encounter the long version of the chorus one last time.

Comparing the structure of the opening tracks of My Beautiful Dark Twisted fantasy and Yeezus cont.


Contrasting his previous release, Kanye’s approach for Yeezus is in favor of a more minimalist approach. The album has been characterized as West’s most experimental and sonically abrasive work. Because of the great reviews for his previous album, expectations were sky high for this release. But, instead of trying to one up his previous masterpiece he does a complete 180. Yeezus is a divisive album, and clearly does not fit in the ‘Old Kanye’ box. Lets have a look at the opening track ‘On Sight’.

It is very hard to see a clear structure from the matrices, and every line is very blurry. This could be because of the way the track is produced, very compressed and industrial sounding. In the chroma matrix we do see the interlude clearly, starting at around 75 seconds into the song, but is less visible in the timbre matrix. Vice versa, the outro which starts at around 120 seconds into the song is visible in the timbre matrix, but not in the chroma matrix.

This indicates that we could also distinguish Kanye’s work using a different approach. But, as we are most interested in clustering his albums using a clustering algorithm, we choose to stick to the audio features mentioned in the previous tabs.

Clustering ‘old’ & ‘new’ Kanye using Spotify’s selected high level features


So, lets find out if there is any distinction between an ‘old’ & ‘new’ Kanye. From our previous work, we found that Spotify’s low level features such as tempo, valence & energy looked like good variables for a clustering algorithm. Using the mean of these features for each album, (as we are interested in grouping albums together and not particular tracks), we create our clusters using hierarchical clustering which can be seen in the dendogram.

If we look at the top level, where we get two clusters, we find that indeed his early work is grouped together. But we also find his latest album, ‘JESUS IS KING’ in this cluster. This indicates that Kanye West is returning to his ‘old’ self, and that this term does not refer to a period in time, but rather a typical sound.

There are some other interesting observations when looking at lower levels of the dendogram. For instance, his collaborative album ‘Watch The Throne’ is an outlier in the sense that it is part of a rather vague & large cluster ‘new Kanye’, but none other. This makes sense for the reason that it is not completely his own work and upon listening to the album, it also does not sound like a Kanye West album.

Other smaller sub-clusters from the ‘new Kanye’ clusters are:

  1. His emotional, sad albums: ‘ye’ + ‘808s and hearthbreak’
  2. His experimental, industrial album: ‘Yeezus’
  3. His more poppy, less thematic albums: ‘The Life of Pablo’ + ‘KIDS SEE GHOSTS’ + ‘My Beautiful Dark Twisted Fantasy’

Clustering ‘old’ & ‘new’ Kanye using Spotify’s selected high level and low level features


We also saw in the previous tabs that we could include Spotify’s lower level features as variables. When adding the average timbre coefficients and average pitch classes for each album to the clustering algorithm, we are obtaining completely different clusters compared to the clusters created when just using the higher level features.

For our first of the two main clusters, we can see that only the first two albums of the ‘Higher Education’ trilogy remain together, and that his 3rd album ‘Graduation’ is now moved to a cluster quite far from the other two. Taken its place, we have his 8th album ‘ye’ which sounds nothing like the first two albums.

For all subclusters, there is not much of an underlying theme and I feel like the previous clusters were more representative.

Conclusion

The purpose of this project was to find possible clusters of albums for the discography of Kanye West. Using Spotify’s API we found suitable features like valence, tempo and energy. Using these features, we found suitable clusters which distinguished his early work from his newer work, whilst his latest album actually belonged to this early cluster. This indicates that Kanye West might be returning to his older self.

We also saw that Spotify’s lower level features such as timbre coefficients and pitch classes could be used to for a clustering algorithm. When including these to the clustering algorithm, we actually found clusters that did not make much sense. This is probably because whilst there are some differences between albums for these features, they are not representative for the albums’ sound.

Concluding, we were able to find an Old Kanye West and a New Kanye West. These labels are mostly related to the emotional level and the tempo of his music. As his latest album can be seen as Old Kanye, I’m eager to find out if his upcoming work is keeping up with this trend, or if we are entering a completely new era of Kanye West.